Statistical speech translation system based on voice recognition optimization using multimodal sources of knowledge and characteristics vectors

نویسندگان

Alejandro Canovas

Jesús Tomás

Jaime Lloret Mauri

Miguel Garcia

چکیده

Synergic combination of different sources of knowledge is a key issue for the development of modern statistical translators. Reconnaissance, and thus the translation, can be improved by adding new heuristic characteristics. In this work, a speech translation statistical system that adds additional other-than-voice information in a voice translation system is presented. The additional information serves as a base for the loglinear combination of several statistical models. We obtain the characteristics vectors using a statistical model that is based on the N-best reconnaissance list. We describe the theoretical framework of the problem, summarize the overall architecture of the system, and show how the system is enhanced with the additional information. Our real prototype implements a real-time speech translation system from Spanish to English that is adapted to specific teaching-related environments. Finally, we will provide and explain the system performance results. A tool like the one presented in this article may increase the participation rate of the foreign students to the lecture classes and talks. Keywordsspeech recognition; speech translation; adaptation; pedagogical tool.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Voice-based Age and Gender Recognition using Training Generative Sparse Model

Abstract: Gender recognition and age detection are important problems in telephone speech processing to investigate the identity of an individual using voice characteristics. In this paper a new gender and age recognition system is introduced based on generative incoherent models learned using sparse non-negative matrix factorization and atom correction post-processing method. Similar to genera...

متن کامل

روشی جدید در بازشناسی مقاوم گفتار مبتنی بر دادگان مفقود با استفاده از شبکه عصبی دوسویه

Performance of speech recognition systems is greatly reduced when speech corrupted by noise. One common method for robust speech recognition systems is missing feature methods. In this way, the components in time - frequency representation of signal (Spectrogram) that present low signal to noise ratio (SNR), are tagged as missing and deleted then replaced by remained components and statistical ...

متن کامل

Persian Phone Recognition Using Acoustic Landmarks and Neural Network-based variability compensation methods

Speech recognition is a subfield of artificial intelligence that develops technologies to convert speech utterance into transcription. So far, various methods such as hidden Markov models and artificial neural networks have been used to develop speech recognition systems. In most of these systems, the speech signal frames are processed uniformly, while the information is not evenly distributed ...

متن کامل

A preliminary study on improving the recognition of esophageal speech using a hybrid system based on statistical voice conversion

In this paper, we propose a hybrid system based on a modified statistical GMM voice conversion algorithm for improving the recognition of esophageal speech. This hybrid system aims to compensate for the distorted information present in the esophageal acoustic features by using a voice conversion method. The esophageal speech is converted into a "target" laryngeal speech using an iterative stati...

متن کامل

Blind Voice Separation Based on Empirical Mode Decomposition and Grey Wolf Optimizer Algorithm

Blind voice separation refers to retrieve a set of independent sources combined by an unknown destructive system. The proposed separation procedure is based on processing of the observed sources without having any information about the combinational model or statistics of the source signals. Also, the number of combined sources is usually predefined and it is difficult to estimate based on the ...

متن کامل

ذخیره در منابع من

ذخیره در منابع من قبلا به منابع من ذحیره شده

{@ msg_add @}

با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

Computer Standards & Interfaces

دوره 35 شماره

صفحات -

تاریخ انتشار 2013

Statistical speech translation system based on voice recognition optimization using multimodal sources of knowledge and characteristics vectors

نویسندگان

چکیده

منابع مشابه

Voice-based Age and Gender Recognition using Training Generative Sparse Model

روشی جدید در بازشناسی مقاوم گفتار مبتنی بر دادگان مفقود با استفاده از شبکه عصبی دوسویه

Persian Phone Recognition Using Acoustic Landmarks and Neural Network-based variability compensation methods

A preliminary study on improving the recognition of esophageal speech using a hybrid system based on statistical voice conversion

Blind Voice Separation Based on Empirical Mode Decomposition and Grey Wolf Optimizer Algorithm

عنوان ژورنال:

اشتراک گذاری